Attainability of boundary points under reinforcement learning
نویسندگان
چکیده
This paper investigates the properties of the most common form of reinforcement learning (the “basic model” of Erev and Roth, American Economic Review, 88, 848-881, 1998). Stochastic approximation theory has been used to analyse the local stability of fixed points under this learning process. However, as we show, when such points are on the boundary of the state space, for example, pure strategy equilibria, standard results from the theory of stochastic approximation do not apply. We offer what we believe to be the correct treatment of boundary points, and provide a new and more general result: this model of learning converges with zero probability to fixed points which are unstable under the Maynard Smith or adjusted version of the evolutionary replicator dynamics. For two player games these are the fixed points that are linearly unstable under the standard replicator dynamics. Journal of Economic Literature classification numbers: C72, C73, D83
منابع مشابه
Nonconvergence to saddle boundary points under perturbed reinforcement learning
For several classes of reinforcement learning schemes, convergence to action profiles that are not Nash equilibria may occur with positive probability under certain conditions on the payoff function. In this paper, we explore how an alternative reinforcement learning scheme, where the strategy of each agent is also perturbed by a strategy-dependent perturbation (or mutations) function, may excl...
متن کاملEffect of Debonding of Rebars on the Seismic Response of Boundary Elements of Lightly Reinforced Shear Walls
Rebar fracture in boundary elements of lightly reinforced shear walls in recent earthquake motivated research on the minimum longitudinal reinforcement applicable to shear walls. These researches lead to change in the ACI 318-19 requirement for minimum longitudinal reinforcement in boundary elements. New ACI 318 requirement increase minimum longitudinal reinforcement ratio for boundary elements...
متن کاملDistributed Dynamic Reinforcement of Efficient Outcomes in Multiagent Coordination and Network Formation
We analyze reinforcement learning under so-called “dynamic reinforcement”. In reinforcement learning, each agent repeatedly interacts with an unknown environment (i.e., other agents), receives a reward, and updates the probabilities of its next action based on its own previous actions and received rewards. Unlike standard reinforcement learning, dynamic reinforcement uses a combination of long ...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملAn Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic
This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Games and Economic Behavior
دوره 53 شماره
صفحات -
تاریخ انتشار 2005